How a protein searches for its specific site on DNA: the role of intersegment transfer 
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Proteins are known to locate their specific targets on DNA up to two orders of magnitude faster 
than predicted by the Smoluchowski three-dimensional diffusion rate. One of the mechanisms pro- 
posed to resolve this discrepancy is termed "intersegment transfer" . Many proteins have two DNA 
binding sites and can transfer from one DNA segment to another without dissociation to water. 
We calculate the target search rate for such proteins in a dense globular DNA, taking into account 
intersegment transfer working in conjunction with DNA motion and protein sliding along DNA. We 
show that intersegment transfer plays a very important role in cases where the protein spends most 
of its time adsorbed on DNA. 



I. INTRODUCTION 

The description of how proteins interact with specific 
sites on DNA is of fundamental importance to molecular 
biology. The effectiveness of a DNA enzyme depends 
entirely on its ability to locate its target site quickly and 
reliably. It was recognized long ago that the search by 
free diffusion through three-dimensional (3D) solution is 
far too slow to account for the observed speed of many 
biological processes, and that proteins somehow arrive 
at their target sites up to two orders of magnitude faster 
than the 3D Smoluchowski rate p], Q 



J, = AirDsbc, 



(1) 



where b is the target radius and D 3 and c are, respec- 
tively, the diffusion coefficient and concentration of pro- 
teins in solution. The idea to resolve this discrepancy 
goes back to Delbriick [3|, who suggested that proteins 
may adsorb fairly quickly onto a nonspecific random 
place on DNA and then undergo one-dimensional (ID) 
sliding along the DNA strand, resulting in an increase 
of the search rate J above J s . Equivalently, we can say 
that the average search time for the proteins t — 1/ J falls 
below the Smoluchowski time t s = 1/ J s . Below we char- 
acterize this rate enhancement by the acceleration ratio 

ts/t. 

The field attracted intensive attention for many years. 
On the theoretical front, the pioneering work by Berg, 
Winter and von Hippel [4[ established the basis of current 
understanding in this field. They showed that ID sliding 
on DNA forms a kind of "antenna" around the target 
site and serves to increase the effective size of the target. 
This large antenna size replaces the actual target size b 
in Eq. (fTJ), resulting in a much faster search rate. The 
Berg, Winter and von Hippel model predicts that the 
rate at which proteins find their specific target sites on 
DNA depends in a nonmonotonic fashion on the ionic 
strength of the solution, which seems to be qualitatively 
consistent with experiments. 

In recent years, the sliding mechanism has been revis- 
ited several times 0, [l(|, but the question of how the 
protein search time depends on DNA conformation was 
not addressed. It is well known that DNA is coiled at 



length scales larger than its persistence length. When 
the coil cannot fit in the volume available, e.g. in the 
nucleoid in a prokaryotic cell, it must be a globule, as it 
is forced to fold back into the volume after each contact 
with the walls. Locally, the globule resembles a transient 
network with a certain mesh size (sec Fig. [TJ). A scaling 
theory was recently proposed to account for the role of 
different DNA conformations [l2| ■ This theory deals only 
with proteins with a single DNA binding site and ignores 
the motion of DNA in solution. Our goal in this paper is 
to relax these restrictions. 




FIG. 1: A DNA globule. The long DNA is forced to return 
many times by the wall of a prokaryotic cell. On the right, a 
blown-up view shows a typical region of the transient network 
at a scale much smaller than the DNA persistence length p. 
This figure represents a very dense case, where the nearest 
neighbor distance between DNA segments, each of length p, 
is shorter than p. 

Berg, Winter and von Hippel [i[ pointed out that in ad- 
dition to ID sliding, proteins with two nonspecific DNA 
binding sites may benefit from another facilitating mech- 
anism termed "intersegment transfer" . Indeed, such pro- 
teins are capable of transiently binding to two DNA seg- 
ments when the segments are close in space, even if they 
are well separated from each other along the DNA con- 
tour. The subsequent segmental diffusion of DNA then 
disrupts these double-bound states, resulting in the pro- 
tein being transferred to a remote position on the DNA 
without net dissociation of protein into the water. 

The existence of intersegment transfer in principle has 
been confirmed by a number of well-designed in vitro ex- 
periments [1, H, Q • These experiments measured the dis- 
sociation rate of proteins from a prepared complex of 
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the protein and a short piece of specific DNA. The com- 
plex was placed in a solution of short nonspecific DNA 
molecules, and the dissociation rate was measured as a 
function of the concentration of nonspecific DNA. All the 
proteins used in [EBB], namely lac repressor [j| , gluco- 
corticoid receptor DNA-binding domain protein [6( and 
human Hox-D9 homeodomain [7| are believed to have two 
DNA binding sites so that the protein-DNA complex can 
adsorb a second short piece of DNA, allowing the protein 
to transiently form a double-bound state with two DNA 
pieces. This double-bound state breaks up quickly (faster 
than the dissociation of protein to water in the prepared 
protein-DNA complex) and the protein has a chance to 
be transferred to the newly adsorbed DNA. As a result, 
the dissociation rate of the complex grows linearly with 
the concentration of nonspecific DNA. This phenomenon 
of inter-DNA transfer is essentially similar to the inter- 
segment transfer of proteins inside a single long DNA 
strand. Direct observation of intersegment transfer was 
also achieved by a scanning force microscopy study of the 
translocation of RNA polymerase in E. coli Q. 

In this paper, we propose a scaling theory of the tar- 
get search time for proteins with two DNA binding sites, 
which combines the effects of 3D diffusion, ID sliding, 
intersegment transfer and DNA motion. Our main inter- 
est is the search time for the biologically relevant case of 
globular DNA. However, its complex geometrical prop- 
erties combined with the several mechanisms of protein 
motion make this problem very complicated. Therefore, 
we start from a relatively simple case, namely the search 
time in a solution of short, straight double- helix DNA 
molecules among which only a small fraction carry the 
specific targets. In this situation we are able to include 
the effects of intersegment transfer and establish connec- 
tions with the in vitro experiments on short DNA [^,0,0|- 
Our analysis of this case is detailed in Sec. II, and a sum- 
mary of the resulting scaling regimes is shown in Fig. [3] 

In Sec. Ill we apply the methods developed for short 
DNA pieces to the case of a very dense DNA globule 
as shown in Fig. [TJ The acceleration rate t s /t is shown 
schematically as the solid line in Fig. [21 plotted as a func- 
tion of y = exp(e/fcsT), where e is the nonspecific ad- 
sorption energy of the protein to DNA. Experimentally, 
the value of y can be controlled through the salt con- 
centration of the solution, since non-specific absorbtion 
of proteins is controlled by Coulomb interaction between 
negative DNA and the positive patch on the protein sur- 
face and may be screened by salt concentration. For com- 
parison, we also plot the result of Ref. [e| , which ignores 
DNA motion and intersegment transfer, as a dashed line. 
In the latter case, the acceleration rate grows first with 
y because protein binding to DNA increases the antenna 
size; then the acceleration rate decays when most of the 
proteins are fruitlessly adsorbed far from the target (or, 
in other words, every protein spends most of the time 
adsorbed far away from the target). Finally, the acceler- 
ation rate saturates and comes to a very low plateau (not 
shown) when the antenna becomes as long as the DNA it- 



self. Hence, when DNA motion and intersegment transfer 
are not accounted for, there is a very strong deceleration 
at large ionic strength compared to the Smoluchowski 
rate. With the help of intersegment transfer, however, 
the acceleration rate saturates at a much higher level 
(larger than unity) because adsorbed proteins become 
much more effective in target search. 




FIG. 2: Schematic dependencies of the acceleration rate on 
the adsorption strength y, with (solid line) or without (dashed 
line) DNA motion and intersegment transfer. Both the accel- 
eration rate t s /t and y are given in logarithmic scale. 

In Sec. IV, we conclude with a discussion of the appli- 
cability of our model and a comparison to the previous 
theory Q. 



II. SIMPLE CASE: DNA IS SHORT 
A. Model and approach 

We assume that within some volume v a number of 
short, rigid (double helix) DNA molecules of length I are 
confined, among which only one piece of DNA contains 
a target site of size b. We call this molecule the spe- 
cific DNA while others are called nonspecific DNA. The 
system considered here is equivalent to an in vitro ex- 
periment with specific DNA concentration 1 jv and much 
larger nonspecific DNA concentration N. 

We further assume that a protein can be adsorbed non- 
spccifically on DNA, and that the nonspecific adsorption 
energy e, or the corresponding constant y = exp(e/fcsT), 
is the same everywhere on the DNA and does not depend 
on the DNA sequence. The only exception is at the target 
site on the specific DNA, where the binding energy is 
much larger. We assume that every protein has two sites 
capable of binding to DNA, so that the protein can be 
bound to two DNA molecules at the same time. 

A non-specifically bound protein is assumed to diffuse 
(slide) along DNA with the diffusion coefficient D\ , while 
protein dissolved in the surrounding water diffuses in 3D 
with diffusion coefficient D%. In the simplest version of 
the theory, we assume D\ — D3 — D. While the protein 
is diffusing, the DNA molecule itself diffuses through wa- 
ter with diffusion coefficient D t . Following the Stokes- 
Einstein relation, D t ~ D(b/l), where b is the size of the 
protein. 

The quantity of our interest is the mean time t needed 
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for the target site to be found by a protein. We want to 
look at the situation in terms of a single protein diffusing 
to its target. In this view, one should imagine that a 
protein molecule is initially introduced into a random 
place within volume v (thus the protein concentration c 
is 1/v), and then ask how fast the protein diffuses to its 
target site on the specific DNA. In order to compare the 
predicted time t to the Smoluchowski time t s — 1/J S — 
1/AnDcb, we shall mainly look at the acceleration rate 



t. 



1 



t t{AirDcb) tDb' 



(2) 



We note that in our scaling theory we drop away both 
all numerical factors and all logarithmic correction fac- 
tors, which exist in the problem because it deals with 
strongly elongated cylinders. In this context, we will use 
the symbol " ~ " to mean "equal up to a numerical co- 
efficient of order one" , while symbols < and > mean < < 
and >>, respectively. Along with these simplifications, 
we also make several assumptions driven by pure desire 
to make formulae simpler and to clarify major physical 
ideas. We assume that all "microscopic" length scales are 
of the same order, namely, the target size b: protein di- 
ameter, double helical DNA diameter, and the distance 
from DNA at which nonspecific absorbtion takes place 
are all considered to be roughly equal to b. 



B. Search time 

Let us imagine for a moment that there is no interseg- 
ment transfer, as is the case for a protein with only one 
nonspecific DNA binding site. One protein is introduced 
into the volume v. The ensuing search process for the 
given single protein consists of tours of ID sliding along 
the nonspecific DNA followed by 3D diffusion in water, 
followed by ID sliding, and so on. On its way to the 
target on the specific DNA, the protein will go through 
many adsorption and desorption cycles, and therefore the 
ratio of the typical time for the protein to be adsorbed, 
t a , and desorbed, td, in a cycle should simply follow the 
equilibrium Boltzmann statistics: 



— ~ y(Nlb 2 ) 
td 



(3) 



The diffusion time in water per cycle td can be es- 
timated as the time a protein needs to find a DNA 
molecule and bind nonspecifically to it. Using Eq. fl}, 
t d ~ l/(l/v)D(Nv)l ~ l/NDl, where 1/v stands for the 
protein concentration c and Nv is the number of DNA 
molecules in the volume v. As a result t a ~ tdy(Nlb 2 ) ~ 

y(b 2 /D). 

Let x be the average length of DNA searched by the 
protein per cycle. Then in order to find the specific site 
(target) among the total Nvl/b sites on DNA, the pro- 
tein should perform such searching cycles roughly Nvl / x 



times. Therefore the search time is given by 

Nul V 

t (t a +t d ) = — (1 + yNlb 2 ). (4) 

x Vx 

Plugging t into Eq. we obtain the acceleration rate 
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t. 



t 1 + yNlb 2 b ' 



(5) 



We can consider two limiting cases to find expressions for 
x. At y < l 2 /b 2 , x ~ (Dta) 1 / 2 ~ y x l 2 b is just the sliding 
distance of the protein on one DNA molecule, while at 
y > l 2 /b 2 , x is limited to the total length of DNA I, There 
are also two limiting cases for the denominator of Eq. {5} . 
When y is relatively small so that y < 1/Nlb 2 , i. e. the 
protein spends most of its time desorbed in water, the 
first term dominates. At y > 1/Nlb 2 , the protein spends 
most of its time adsorbed and the second term domi- 
nates. As a result we obtain four scaling regimes shown 
in the phase diagram of Fig. Eta). We terminate the 
phase diagram at the concentration Nl 3 = l/b because 
in a denser system liquid crystalline nematic ordering of 
DNA molecules becomes likely. The dependencies of the 
acceleration rate on y for a semi-dilute solution of short 
DNA pieces are plotted in Fig. 0Ja). The search rate is 
shown to increase at first due to the increase of x, and 
then decrease due to the fact that at large y the protein 
spends most of the time adsorbed on nonspecific DNA 
molecules, which slows down the diffusion to the target. 

Now let us move on to the case of a protein with two 
DNA binding sites. When a piece of DNA with an ad- 
sorbed protein collides with another DNA molecule, the 
protein has some probability to move directly to the new 
molecule. If the inter-DNA transfer is faster than the 
dissociation of protein into water, i.e. if the average time 
Tt required for a protein to be transferred from one piece 
of DNA to another is shorter than the adsorption time 
t a , then the protein can explore several DNA molecules 
during t a . As a result, the protein can visit a large num- 
ber of different sites during adsorption and the efficiency 
of ID search on DNA is greatly enhanced. We find be- 
low that at large y when r t < t a and inter-DNA transfer 
dominates, the protein already spends most of the time 
adsorbed and t a > t<$. Therefore we neglect the time 
spent in water td and redefine x/b as the number of dif- 
ferent sites explored on the same DNA during Tt- The 
search time can then be estimated as 



t 



Nvl 



so that we obtain the acceleration rate 



x 



t 



NDlbr, 



(6) 



(7) 



The results are shown in the diagram Fig. [31(b). At 
large y the acceleration rate stops decreasing with y and 
saturates. 



4 



1/yNb 3 




l/b Nl a 




(a) 



(b) 



FIG. 3: "Phase diagram" for the acceleration rate t s /t in the plane of y and Nl 3 , where I is held constant. Both the y and 
Nl 3 axes are in the logarithmic scale. The ratio t s /t is shown in black on the background of each region, (a) shows scaling 
dependencies without inter-DNA transfer; (b) gives results with inter-DNA transfer, where t s /t saturates at large y. 



We begin explaining our results by calculating the most 
important quantity of our theory: Tt. For a dilute solu- 
tion of DNA molecules with Nl 3 < 1, one can use Eq. ((T|) 
to find the time for a given DNA molecule to enter the 
spherical region occupied by another piece of DNA by re- 
placing D 3 and b by the DNA diffusion coefficient D(b/l) 
and the length I. The result is l/D(b/l)Nl = l/DNb. 
When the given DNA molecule enters the sphere of an- 
other molecule and diffuses over distance I, on average 
every site on the DNA has a chance to collide with the 
second DNA before it leaves the sphere. As a result, a 
protein adsorbed on one DNA can essentially always get 
transferred to the new one during a collision. Since in 
a dilute solution the diffusion time to find such a sphere 
containing a second DNA piece, l/DNb, is larger than 
the diffusion time within the sphere, l 2 /D(b/l) ~ l 3 /Db, 
the transfer waiting time Tt is the order of l/DNb. Be- 
cause Drt > I 2 , the protein searches l/b different sites 
during r t and x ~ I. Using Eq. ([6]), we obtain the search 
time 

Nvl v 
t~— Tt ~ — ~t.. (8) 



When Nl 3 > 1, the spheres containing individual DNA 
molecules strongly overlap. In such a semi-dilute solu- 
tion, the first collision for a given DNA molecule hap- 
pens when it diffuses over the nearest neighbor distance 
r p . One can find r p by constructing an imaginary cylin- 
der with radius r p around each DNA molecule, where the 
length of the molecule serves as the cylinder's axis. Be- 
cause the excluded volume of a cylinder is ~ l 2 r p , the 
radius r p should satisfy Nl 2 r p ~ 1 and thus scale as 
1/Nl 2 . During time 
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the DNA diffuses over a distance r p , giving every site on 
some segment of the DNA of length r p the opportunity to 
collide once with the nearest-neighboring DNA (see Fig. 
[5]). After time r, the diffusing DNA and its neighbors 
have moved around enough that the nearest-neighboring 
region (shown by a dashed circle) may be considered to 
have shifted to a random place on the DNA. 

Let us assume that the protein has just arrived at some 
place on the given DNA molecule. In order to be trans- 
ferred to another DNA within time r, the protein must 
reach the segment of length r p (see Fig. [5J during r. 
Since the typical distance between the adsorbed protein 
and the nearest neighboring region is just proportional 
to the DNA length I, the protein will change molecules 
during r when Dt > I 2 , or Nl 3 < (l/b) 1 / 2 . Therefore 
Tt ~ t and we obtain the search time 



t ~ Nvt, 



1 



t 



Nl 3 Db Nl 3 ' 



^ (io) 



D(b/l) DN 2 l 3 b' 



(9) 



from which we can see that the search rate saturates Nl 3 
times faster than the Smoluchowski rate, and that the 
acceleration rate grows with DNA concentration since 
denser solution makes inter-DNA transfer easier. 

When Nl 3 > (l/b) 1/2 , the ID sliding distance of pro- 
tein on a single DNA molecule during t is x ~ (Dt) 1 / 2 < 
I. Therefore, the probability for the protein to reach 
the nearest neighboring region on the DNA during r is 
x/l ~ (l/b) 1 / 2 /Nl 3 < 1. In this case the transfer waiting 
time Tt > t, and it should be calculated self-consistcntly. 
During r t the sliding distance x of the protein is (Drt) 1 / 2 , 
so the probability for the protein to reach a specified 
nearest neighboring region is on the order of (£>r t ) 1/2 /I. 
Since the nearest-neighboring region changes to a random 
place on the DNA after r, there are r t /r such regions dur- 
ing time r. Therefore the probability for the protein to 
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1<NI J < l/b 



(a) 




As a result, 
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(b) 




1/Nlb 2 l 2 /b 2 



(c) 
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l/b- 



(l/b) 1/2 < Nl 3 < l/b 
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2n4/3 



y 



FIG. 4: Schematic dependencies of acceleration rate on y for a 
semi-dilute (Nl 3 > 1) solution of short DNA pieces, (a) with- 
out inter DNA transfer; (b) and (c) with inter DNA transfer. 
The fraction next to each curve shows its slope (the power 
dependence of t 3 /t on y). 




FIG. 5: Collision of a DNA molecule with its nearest neighbor 
at distance r v (other DNA molecules are not shown). 



reach any one of these regions and then get transferred 
should satisfy 



I T 



1. 



(11) 



D(N 2 l 2 b) 2 / 3 ' 



(12) 



and the search time is given by 



t 



Nvl 



r t ~ (Nlb 2 ) 1 ' 3 ^- ~ (Nlb 2 y/h s . (13) 



Db 



The equations for the crossover lines at large y, shown 
in Fig. Hfb), are obtained by equating r t to t a . This 
condition determines the range of parameters for which 
intersegment transfer takes over, i.e. when the time it 
takes the protein to transfer between DNA molecules is 
much shorter than the time the protein spends adsorbed 
on the DNA then we can say that intersegment transfer 
is the dominant mechanism. The dependencies of the 
acceleration rate on y for semi-dilute DNA concentrations 
with Nl 3 > 1 are schematically plotted in Fig. 0] (b) and 
(c). For the purpose of comparison, we also show the 
dependencies for proteins with a single binding site in 
Fig. [1(a). 

A new feature shown by Fig. H (b) and (c) is that, for 
proteins with two DNA binding sites, inter-DNA trans- 
fer stops the search rate from decreasing and causes it 
to saturate at large y. It can be shown from equations 
©, l|10p and l|13p that the acceleration rate is constant 
and ~ 1 when the solution is dilute and y is large. The 
acceleration rate begins to grow as the concentration is 
increased past Nl 3 ~ 1, peaking when Nl 3 ~ (l/b) 1 / 2 and 
achieving a maximum value of (l/b) 1 / 2 . After the peak, 
it decreases again and reaches (l/b) 1 / 3 when Nl 3 ~ l/b. 

Before we move on to next section, we should empha- 
size that in our calculation we have completely neglected 
the energy barrier associated with breaking the double- 
bound state. We have assumed the barrier to be so small 
that the lifetime of the double-bound state is a small 
correction to the above calculated Tt- The search time 
we have found is therefore the lower limit which can be 
achieved with the help of intersegment transfer. In Sec. 
IV, we will return to this issue in more detail. 



C. Dissociation rate 

Since in experiments @,@,0] the role of intersegment is 
inferred from measuring the dissociation rate of the pre- 
pared protein-DNA complex, in this section we calculate 
this rate for a protein adsorbed on a nonspecific piece 
of DNA dissociating to other nonspecific DNA pieces via 
inter-DNA transfer. [H[ 

The calculation is quite straightforward and the results 
are presented in the phase diagram of Fig. [6] The appar- 
ent dissociation rate is just l/t a + l/r t , where each term 
represents a possible relaxation process undergone by the 
adsorbed protein: either dissociation to water or inter- 
segment transfer to another piece of DNA. The faster 
process dominates the rate. Since the dissociation rate 
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to water decreases with the adsorption strength y and 
the intersegment transfer rate grows with the nonspecific 
DNA concentration, intersegment transfer dominates the 
apparent dissociation rate at relatively large y and A. 
We find that the enhanced dissociation rate grows lin- 
early with nonspecific DNA concentration A when the 
solution is dilute, in agreement with the experiments 
[S S Q- I n semi-dilute solution, however, the dissoci- 
ation rate has power law dependence on A, with power 
equal to either 2 or 4/3. 
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FIG. 6: "Phase diagram" for the dissociation rate under the 
influence of inter DNA transfer. 



III. DNA IS A GLOBULE 

After exploring the particular role of DNA motion and 
protein intersegment transfer for short DNA pieces, we 
are well prepared to generalize the above results to the 
more realistic case of globular DNA (see Fig. [TJ. We 
focus here on cases with large y, where the mechanism 
of intersegment transfer is important. The results for 
acceleration rate at small y, where interse gme nt transfer 
does not help much, can be found in Ref. 

We assume that within some volume v, a double he- 
lical DNA with contour length L and persistence length 
p ^> b is confined. We disregard the excluded volume of 
DNA, considering the DNA coil to be Gaussian and not 
a swollen coil, described by the Flory index 3/5. This 
is a reasonable approximation for most realistic cases. 
Indeed, for many real DNA molecules such as A-DNA, 
it is justified because of the large persistence length-to- 
diameter ratio of the double helix: excluded volume in 
the coil remains unimportant up to DNA length about 
L < p 3 /b 2 (as much as 100000 base pairs under nor- 
mal ionic conditions). When the DNA is very long for 
a given volume, specifically, when the gaussian coil size 
(hp) 1 ! 2 > v 1 / 3 , it cannot remain a Gaussian coil, but 
must fold back to make several smaller overlapping coils. 
In other words, it must be a globule which locally resem- 
bles a transient network. 



In order to simplify our calculation, we can approxi- 
mate the DNA as a series of freely-jointed straight seg- 
ments (rods), each with persistence length p. We further 
restrict our study to a globule so dense that the spheres 
containing each rod strongly overlap (Fig. [l}. Except for 
the connectivity, the globule is quite similar to a semi- 
dilute solution of short straight DNA pieces of length p 
and concentration A = (L/p)/v satisfying Np 3 > 1. In 
this case the diffusion distance r p for a given rod to expe- 
rience its first collision with another rod, which may be 
close in space but far removed along the DNA contour, 
is shorter than its length p. As a result, one can disre- 
gard the correlation of motion between connected rods 
and treat the motion of each rod over the short distance 
r p separately as a normal diffusion process with diffusion 
coefficient D(b/p). 

Let us first look at a simple case where the ID sliding 
distance x for a protein on a single DNA rod is shorter 
than the chain length p. As before, we consider x to be 
the distance traveled by the protein within a time r*, the 
average waiting time before a protein is transferred from 
one DNA rod to another, uncorrelated rod. In this sit- 
uation, the protein does not feel the connection between 
rods. Therefore, we can simply use the result for short 
DNA pieces, replacing the length I by p and using the 
rod concentration A = (L/p)/v. Then Eq. (fT3|) gives 
the search time 

t ~ {Npb 2 )^ 3 (v/Db) ~ {Lb 2 /v)^ 3 t s . (14) 

From Eq. dH]), we find r t - 1/ D(N 2 p 2 b) 2 ' 3 and thus 
x ~ (Drt) 1 / 2 ~ 1/(AV&) 1/3 - So tne condition x < p 
is fulfilled when Np 3 > [p/b) 1 ' 2 or L > (v/b 2 )(b/p) 3 / 2 . 
Furthermore, to avoid the liquid crystalline nematic or- 
dering of DNA chains, we assume that Ap 3 < p/b or 
L < v/pb. 

When the concentration of DNA rods is small enough 
that it falls within the range 1 < Np 3 < (p/b) 1 / 2 , the sep- 
aration between DNA rods becomes large. Therefore, the 
time between collisions increases. As a result, the trans- 
fer waiting time Ti grows and the ID sliding distance 
of the protein x becomes larger than p. In this case, 
one should be careful in calculating the DNA diffusion 
distance that results in the first collision between DNA 
rods. It is no longer equal to the nearest neighbor dis- 
tance r v ~ 1/Np 2 between DNA rods of length p. To find 
this distance, let us concentrate on the continuous piece 
of length x > p, which spans several rods. The short- 
est distance from this piece of DNA to another similar 
piece is realized at only one of its constituent rods. The 
first collision that could result in transfer of the protein 
happens only when this particular rod diffuses over the 
x-dependent nearest neighbor distance r(x) ~ r p p/x ~ 
1/Npx. During time t(x) ~ r 2 (x)/D(b/p) 1 on average 
each DNA piece of length x experiences a collision, and 
the protein slides a distance x across the DNA. Thus, 
the waiting time for a protein to be transferred to an- 
other, uncorrelated DNA piece r t ~ t(x) ~ 1/ DN 2 x 2 pb 
should be equal to the ID sliding time x 2 / D of the pro- 
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tein on a single piece. This self-consistent calculation 
gives x ~ (l/iVp) 1 /2( ? ,/6)i/4 and T( _ (i/DNp)(p/b)^ 2 . 
We therefore obtain the search time 

L ,„ 3M/a ^\ 3/4 « (Lb 2 \ l/2 /pxi/4 

(15) 

As explained in Ref. [12j |. without intersegment trans- 
fer, large values of y result in the protein spending 
most of its time adsorbed on DNA far from the target 
site. The result is that the search time saturates at 
L 2 /D ~ (L 2 b/v)t s , which is a huge deceleration com- 
pared to the Smoluchowski time. From Eqs. (fT4)) and 
(fT5|) . one can easily find that at large y the search time 
is greatly reduced below t s by the combination of ID 
sliding, intersegment transfer and DNA motion. Corre- 
spondingly, the acceleration rate is enhanced and can be 
larger than 1, as shown by the solid line in Fig. [5] These 
results remain qualitatively correct for a sparser globule 
with p/v 1 / 3 < Np 3 < 1 or v 2 ! 3 jp < L < v/p 2 , where 
the typical mesh size of the transient network is longer 
than p and thus the piece of DNA inside each mesh is not 
straight as shown in Fig. Q]but rather a small Gaussian 
coil. To fully account for this kind of geometry, however, 
one should consider a more complicated correlated seg- 
mental diffusion of DNA, and this is beyond the scope of 
the current paper. 

Until now, we assumed that D\ — D% — D, where 
D\ and D3 are the diffusion coefficients of protein on 
DNA and in water, respectively. In fact, the random 
sequence of DNA and the resulting sequence-dependent 
nonspecific adsorption energy most likely produces D± < 
D3. To illustrate the role of ID sliding in conjunction 
with intersegment transfer, we fix D3 = D and calculate 
the acceleration rate for various values of D\ following 
the methods explained above. The results for large y, 
where the intersegment transfer plays an important role, 
are shown in the plane of D±/D and Np 3 in Fig. [71 The 
dashed line corresponds to D\ = D% = D. We find that 
the acceleration rate grows as {D\/D) with the index S 
increasing from 1/2 to 1. 



IV. DISCUSSION 

In our theory, we completely neglect the effect of the 
energy barrier e* associated with breaking the double- 
bound state and reverting to a single-bound state. Our 
results are therefore an upper estimate of the effect of 
intersegment transfer. On the other hand, a naive guess 
of the barrier height is e* = e, since to break one of the 
two contacts the protein has to pay the adsorption energy 
per binding site on one side. If this were true, the pro- 
tein would be trapped in the double-bound state for the 
adsorption time t a , and therefore the inter-DNA transfer 
could not do better job in accelerating the dissociation 
of the protein from the protein-DNA complex than des- 
orption into water. As a result, adding DNA into the 




FIG. 7: "Phase diagram" for the acceleration rate at large y 
on globular DNA, where the intersegment transfer plays an 
important role in the target search. 



solution of a protein-DNA complex would not increase 
the dissociation rate of the protein, which clearly con- 
tradicts the in vitro experiments on various proteins and 
DNA molecules HlfilQ. 

This suggests that in the double- 
bound state, the binding strength per binding site e* < e, 
which could be a result of the excluded volume of close 
DNA molecules or the Coulomb repulsion between them. 

The experiment Q showed that the dissociation rate 
increases linearly with the nonspecific DNA concentra- 
tion and saturates at large concentrations. This implies 
that at small DNA concentration, the dissociation rate 
is limited by the diffusion of nonspecific DNA molecules 
and the resulting collisions that induce inter-DNA trans- 
fer. As the DNA concentration is increased, the energy 
barrier for releasing the protein from the double-bound 
state becomes the bottleneck of the dissociation. Since 
the lifetime of the double-bound state does not depend 
on the nonspecific DNA concentration, the dissociation 
rate saturates. Having e* < e in mind, one can show that 
our theory is valid if the transfer waiting time r t is larger 
than the lifetime of the double-bound state. We can es- 
timate this lifetime as the product of the characteristic 
time scale b 2 /D and the binding strength per site in the 
double-bound state y* ~ exp(e*//csT). Thus our theory 
works when y*(b 2 /D) < r t . When y*(b 2 /D) > rt, our 
main idea is still correct, however one should replace rt 
by the lifetime of the double-bound state (b 2 /D)y* and 
repeat a similar analysis. The acceleration rate will be 
diminished as a result but will remain much larger than 
in the case without intersegment transfer. 

The above discussion of e* assumes that the double- 
bound state does not affect the equilibrium Boltzmann 
statistics represented by Eq. This places an addi- 

tional restriction on e* . The energy of the double-bound 
state is 2e*. If one were to take a snapshot of the solution 
of short DNA pieces at a given time, the number of DNA 
contacts (where two DNA collide) per DNA strand is 
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on the order of Nl 2 b, where l 2 b represents the excluded 
volume of a rod-like DNA. Then the limitation on y* 
can be expressed as exp(2e*/fcsT)(M 2 6)(6 3 ) < ylb 2 or 
(y*) 2 < y/Nlb 2 . 

Let us now compare our work with the treatment of 
intersegment transfer in Ref. While our work com- 
bines both mechanisms of ID sliding and intersegment 
transfer, the Ref. [4[ treats them separately. Neglecting 
the mechanism of protein sliding in a description of in- 
tersegment transfer results in a huge overestimation of 
the collision time r and the subsequent transfer time T t . 
Indeed one can see from Fig. [7] that if the protein can- 
not move on DNA, the acceleration rate is b/p, which is 
much smaller than the acceleration rate at D\ = D3 = D. 
Equivalently, neglecting intersegment transfer results in 
overestimation of the sliding time, sliding distance and 
search time. In the later review of Ref. [l8|], the in- 
terplay between sliding and intersegment transfer was 
taken into account. Qualitatively the conclusions agree 
with our results, however the dependence of the inter- 
segment transfer rate on the characteristics of DNA ge- 
ometry, DNA motion, concentration and the nonspecific 
adsorption strength of protein to DNA y was not calcu- 
lated. 

Finally, we note that our theory can be easily adapted 
to study the effective diffusion rate of a protein through 
a solution of polymers like DNA. This problem was stud- 
ied in Ref. [13], assuming D\ — 0. Following the ideas 
of our paper, one can expand on this study to account 
for the "constructive interference" of ID sliding and in- 
tersegment transfer of protein, which was not addressed 
in Ref. [l7|. As with target search, intersegment trans- 
fer enhances the macroscopic diffusion coefficient of pro- 
teins at large y, where the protein spends most of its 
time adsorbed on DNA. We can consider a solution of 
short DNA molecules, where without intersegment trans- 
fer the effective diffusion coefficient of the protein is de- 
creased by nonspecific adsorption to DNA and eventu- 
ally saturates at the DNA diffusion coefficient D(b/l). In 



a dilute solution, intersegment transfer does not assist 
the macroscopic diffusion of proteins, since each DNA 
molecule is far removed from other molecules and there- 
fore the macroscopic displacement of protein is deter- 
mined mainly by the motion of the DNA. In a semidilutc 
solution, however, where 1 < Nl 3 < l/b, ID sliding on 
DNA becomes important. When 1 < Nl 3 < (l/b) 1 / 2 , 
Dr t > I 2 and the ID sliding distance of protein during 
time Tt is limited to the length of DNA /. In this case 
r t ~ r. Using Eq. ©, the effective diffusion coefficient 
is obtained as l 2 /r t ~ N 2 l 6 D(b/l). At higher densities 
when (l/b) 1 / 2 < Nl 3 < l/b, Dr t < I 2 . As a result, the 
nonspecific adsorption of protein on DNA does not hin- 
der the diffusion of protein at all and the macroscopic 
diffusion coefficient is just D. For D\ ^ D3, a similar 
analysis can be performed. 

One further application of our theory is to the problem 
of dynamic (stirred) percolation, e.g., the conductivity 
of well-conducting wires in some insulating liquid. It is 
well known that if the wires are randomly frozen in the 
liquid, the conductivity vanishes below the percolation 
threshold [l9[ . However, because of the diffusion of wires 
in the liquid, the charge carriers are not trapped within 
finite clusters of wires. Instead, they can hop from one 
wire to another when the wires approach close to each 
other. This results in a finite conductivity below the 
percolation threshold [2(j. For such systems, one can 
find the macroscopic diffusion coefficient of the charge 
carriers and then map it to the effective conductivity of 
the system. 
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